Picture for Pierre Ablin

Pierre Ablin

Ecole normale supérieure, Paris, France

Balanced LoRA: Removing Parameter Invariance to Accelerate Convergence

Add code
May 29, 2026
Viaarxiv icon

Mix, Don't Tune: Bilingual Pre-Training Outperforms Hyperparameter Search in Data-Constrained Settings

Add code
May 13, 2026
Viaarxiv icon

Scaling Laws for Mixture Pretraining Under Data Constraints

Add code
May 12, 2026
Viaarxiv icon

Locking Pretrained Weights via Deep Low-Rank Residual Distillation

Add code
May 11, 2026
Viaarxiv icon

DynaMiCS: Fine-tuning LLMs with Performance Constraints using Dynamic Mixtures

Add code
May 11, 2026
Viaarxiv icon

Nectar: Neural Estimation of Cached-Token Attention via Regression

Add code
May 10, 2026
Viaarxiv icon

Optimal Splitting of Language Models from Mixtures to Specialized Domains

Add code
Mar 19, 2026
Viaarxiv icon

The Design Space of Tri-Modal Masked Diffusion Models

Add code
Feb 25, 2026
Viaarxiv icon

LaCy: What Small Language Models Can and Should Learn is Not Just a Question of Loss

Add code
Feb 13, 2026
Viaarxiv icon

Completed Hyperparameter Transfer across Modules, Width, Depth, Batch and Duration

Add code
Dec 26, 2025
Viaarxiv icon